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Abstract: 

The kinetic behavior of a three-dimensional off-lattice heteropolymer model is studied in 
terms of the time dependence of the average mean-square displacement between configura- 
tions. It is found that at short time-scales similar behavior is obtained even for sequences 
with very different thermodynamic properties. Furthermore, the degree of cooperativity in 
the folding process is examined by studying the residual number of degrees of freedom, ob- 
tained from an eigenvalue analysis of the correlation matrix, contributing to the structural 
fluctuations. In the compact state, a gradual decrease in this effective number of degrees of 
freedom take place as the temperature is lowered. This can be interpreted as an increasing 
asymmetry of the energy landscape. 
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1 Introduction 



In order to understand the characteristics of the energy landscape and to quantify the requirements needed 
for a protein to have a thcrmodynamical stable yet kinetically accessible native state, the introduction of 
simple models Q is necessary. In these models the small length and time scales are effectively averaged out 
and the thermodynamics of the resulting coarse grained chain is described by an effective potential. The 
basic assumption for this is that a low-resolution description still captures features essential for describing 
folding properties. 

Models which have been much studied are the lattice models where the protein is represented as chain 
of beads on a cubic lattice (see references in This approach has indeed proven to be very useful 

and one is able to describe several non-trivial aspects in the folding process. However it is important to 
study alternative off-lattice models both in order to understand the limitations of the lattice models and 
in their own right. In this work a 3D off-lattice model suggested in um is used. The model contains two 
types of amino acids, hydrophobic and hydrophilic, and the formation of a hydrophobic core is induced by 
a sequence dependent Lennard-Jones potential. In j^] the studies were focused on the thermodynamical 
behavior of this model. 

For a sequence to be a good folder it must satisfy both thermodynamic and kinetic requirements. That is the 
ground state must be stable against thermodynamical fluctuations, which happens at low temperatures, 
and yet be kinetically accessible, which happens at higher temperatures. Therefore only sequences for 
which the thermodynamic stability persists at high enough temperatures can be classified as good folders. 
To understand the behavior of the structural fluctuations in this context the overall shape diffusion at 
short times is studied as well as the collective behavior of the conformational correlations in the chain. 

This letter is organized as follows. Section two defines the model and the observables. Section three and 
four contain the results and a brief summary respectively. 



2 The model 



The model contains two kinds of residues, A and B, which behave as hydrophobic (cr, ='A') and hy- 
drophilic (<7j ='_£?') residues, respectively. These monomers are joined by rigid bonds bi to form a linear 
heteropolymer chain living in three dimensions. The shape of the chain is thus specified by the N — 1 bond 
vectors bi. The energy function is given by 

JV-2 iV-3 JV-2 N 

E(b; a) = -Ki }^ bj ■ b i+1 - k 2 h ■ b i+2 + }^ }^ E hJ (rij;ai,aj) (1) 

i— 1 i— 1 i— 1 j=i-\-2 

where r.y = ■ ■ ■ , bj-i) denotes the distance between sites i and j of the chain, and a±, . . . , <7n is a 

binary string that specifies the primary sequence. The species-dependent global interactions are given by 
the Lennard-Jones potential, 

E^in^a^aj) = 4e(o-i,£r,-)(i - 4f) ■ ( 2 ) 

ij ij 

The depth of the minimum of this potential, e(<7j,<7j), is chosen to favor the formation of a core of A 
residues, i.e. e(A,A) — 1 and e(A,B) = e(B,B) = 1/2. The two parameters of the energy function, k% 
and K2, determine the strength of species-independent local interactions. The thermodynamic behavior 
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of this model has been studied in Ref. Q). It was found that the values («i,«2) = (—1,0.5) give rise 
to local correlations qualitatively similar to those found in functional proteins. This choice favors anti- 
parallel nearest neighbour bonds and parallel next nearest neighbour bonds and will be used throughout 
this work. Furthermore it was found that, as the temperature is lowered, a gradual compaction occur. In 
the compact state this is then followed by a sequence dependent folding transition. For this model only 
a small fraction of random sequences have good folding properties. In 2D where a more extensive 
investigation was done, only around 10 percent satisfied the folding criteria. This model to some extent 
resembles the IMP model j|, |j| which is a Gaussian chain augmented with a Lennard- Jones potential with 
an additional quenched disorder term s/e rjij /r|„- representing a species-dependent interaction. The rji^s 
are stochastic variables having zero mean and unit variance, while e is a measure of the strength of the 
quenched disorder. 

In order to study the kinetics on a rugged energy landscape some kind of distance measure between 
conformations has to be defined. A natural choice Q is the mean-square displacement between two 
configurations a and b, 5^ b : 

1 N 

^nrin-^K-x^ 2 (3) 

i=i 

where x"^ denotes the position of monomer i in system a(b). The minimization is to be performed over 
translations, rotations and reflections. 

With Sq denoting the distance to the ground state, and P(Sq) the corresponding probability distribution, 
we can define the probability for the system to be found in the vicinity of the ground state as: 

1-0.04 

po = / P{5 2 )d5l (4) 
Jo 

The folding temperature Tf is then defined as the temperature where po = 1/2. 



3 Results 

We use six sequences, of length N = 20 (Table @), chosen in g to represent a variety of behavior, and 
examine their kinetic properties for short (intermediate) times. That is, the times should be large enough 
so as not depend on details of the MC-method (here the normal Metropolis algorithm) but smaller than 
the time scales necessary to equilibrate the systems. For each system 0(100) Monte Carlo runs, each 
consisting of 3.3 ■ 10 5 steps were performed. The time averages of the mean-square displacement (eq. |!|) 



no. 


sequence 


Tf 


1 


BAAA AAAB AAAA BAAB AABB 


< 0.15 


2 


BAAB AAAA BAB A ABAA AAAB 


< 0.15 


3 


AAAA BBAA AABA ABAA ABBA 


0.23 


4 


AAAA BAAB ABAA BBAA ABAA 


0.22 


5 


BAAB BAAA BBBA BAB A ABAB 


< 0.15 


6 


AAAB BABB ABAB BAB A BAB A 


0.15 



Table 1: The six sequences studied. The errors in Tf are approximately 0.02. 
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Figure 1: po - the probability for the system to be found close to the native state, and r 2 yr - the radius 
of gyration squared, as a function of temperature. 

(5 2 }t are obtained by averaging over consecutive subsets of conformations with temporal extension t (this 
corresponds to an average time separation of « i/3). The simulations were performed at three different 
temperatures T = 0.15,0.23 and 0.4. The two lowest correspond to folding temperatures of sequences 3 
and 6 respectively (see Table [j]), whereas T — 0.4 is low enough to correspond to compact states but high 
enough so that the ground state is almost completely de-populated. This can be seen from Fig. [I] (data 
from Q), in which the population of the ground state po and the radius of gyration (squared) is plotted 
versus temperature. 

In Fig. U the time dependence of (<5 2 )t is displayed for short times. The data are well parametrized by a 
behavior of the type 

(S 2 )t oc V (5) 

with a temperature dependent exponent v. For a harmonic chain one has v — 1/2 while the T — > oo 
limit gives v = 1 ||. At short times the high temperature limit is valid also for this model. For the 
sequences here examined v grows with temperature (Table |^) in a roughly sequence independent manner. 
At a fixed (absolute) temperature there is a small difference in that the best folder is the chain with the 
slowest kinetics. This can partly be attributed to the fact that this system spend a lot of time in the 
vicinity of the native state although this effect mainly should affect the pre factor in eq. pi On the other 




7 8 9 10 11 12 13 7 8 9 10 11 12 13 

ln(t) ln(t) 

Figure 2: (8 2 )t as a function of t for sequence no. 1 and 3. The data correspond to, from top to bottom, 
the temperatures T = 0.4, 0.23 and 0.15. The solid lines are linear fits. 

hand when comparing the systems at the same "physical" temperature the difference is larger and in the 
other direction. The best folder now has the fastest kinetics. That is, when comparing the sequences 
at temperatures chosen such that the chains spend an equal amount of time in the native state, high 
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no. 


T=0.15 


0.23 


0.4 


1 


0.31 


0.41 


0.58 


2 


0.31 


0.40 


0.57 


3 


0.27 


0.36 


0.56 


4 


0.29 


0.37 


0.56 


5 


0.32 


0.42 


0.59 


6 


0.30 


0.40 


0.58 



Table 2: The exponent v in eq. 5 for the different sequences at different temperatures. The errors are 
approximately 0.03 



thermodynamic stability implies fast kinetics. This means that the crucial requirement to be satisfied for 
a sequence to represent a good folder is thermodynamic stability of the ground state. The more stable 
the ground state is the faster becomes the kinetics, at the folding temperature, towards this ground state. 
This is in line with what was found in [^| . With a somewhat different distance measure a similar study was 
performed for the IMP-model in || . The investigation was here focused on the behavior of the exponent v 
at fixed temperature when the strength e of the quenched disorder was changed. The results ranged from 
v w 2/3 for the ordered system (e = 0) to v as 1/2 for a chain with strong quenched disorder (e = 10). The 
difference between different realizations of the couplings rjij at fixed value e was found to be quite small 
although no distinction was made between these realizations in terms of their thermodynamic properties. 

Next we study how the structural fluctuations decay as the temperature is lowered. All thermal aver- 
ages where obtained with the "simulated tempering" method The torsional angles are defined by 
cos^fc = (bft x bk+i) ■ (bk+i x bk+2)- In Fig. || the fluctuation in these angles, defined by — {\4>k\) 2 is 
showed as function of temperature. The lines represent a decreasing sequence of temperatures ranging from 
T = 1.67 (top) to T = 0.15 (bottom). As can be seen, the sequence with good folding properties (seq. 3) 




Figure 3: Fluctuation of torsional angels for sequences no. 1 and 3. The data corresponds to, from top to 
bottom, the temperatures T = 1.67, 0.41, 0.23 and 0.15. 

have a more drastic "freeze out" of the (torsional) degrees of freedom, and at the lowest temperature, 
where po as 0.9 only a few (3) of the torsion angles have significant fluctuations. For sequence 1 on the 
other hand po(T — 0.15) as 0.25, i.e. the thermodynamic stability requirement for the ground state is not 
yet satisfied, all the angles show rather large thermal fluctuations. These measurements do however not 
provide us with any information concerning the amount of correlations present in this process. In order to 
examine this issue we estimate an effective number of degrees of freedom for the chain, by calculating the 
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Figure 4: a) The "effective size" of the chain as a function of temperature, b) The scalar product of the 
leading eigenvector at T with the one at T = 0.15. The data correspond to seq. 1 (x), seq. 7 (o) and seq. 
16 (□) respectively. 



eigenvalues to the correlation matrix 

Pij = (k ■ bj) (6) 

We define an "effective size" of the chain as 

< 7 » 

where Ao is the largest eigenvalue of p. For this particular choice of correlation matrix the trace equals 
N — 1. As can be seen from Fig. || a there is a gradual decrease in the number of effective degrees of 
freedom as the temperature is lowered. This is in contrast to the behavior expected from a quadratic 
potential, for which N e & would have been temperature independent. Thus the energy landscape becomes 
more and more asymmetric as the temperature goes down. Comparing with the behavior of T gyr and po in 
Fig. |l| we see that this "freezing out" effect occurs mainly in the compact state but before the ground state 
is populated. In order to see how high in temperature the asymmetry defined at the lowest temperature 
persists we show in Fig. [|b the scalar product between the dominant (normalized) eigenvector at T = 0.15, 
eo(0.15) with the corresponding one at temperature T, eo(T'). This represents a measure on how similar 
the direction specifying the largest structural fluctuations is compared with the principal direction of the 
"valley" hosting the native state, or rather the dominating "valley" at T — 0.15. For seq. 3 and 4 at 
least these coincide. As can be expected at high temperatures there is no memory of this direction and 
although the qualitative behavior is similar for the different chains the sequence dependence is rather large 
compared to that of N £ jj. This reflects the fact that e (0.15) ■ eo(T) is directly related to the properties of 
the low-end of the energy spectrum while N £ jj is more general in character. 



4 Conclusions 



We have studied the kinetic behavior of a 3D off-lattice protein model in terms of shape diffusion at 
short time scales in connection with folding. We find that sequences with very different thermodynamic 
behavior, e.g. with respect to the stability of the ground state, have fairly similar kinetic behavior at short 
time scales. This suggests that the crucial requirements for a chain to have good folding properties are 
mainly thermodynamic in nature. Furthermore, we examine the behavior of the structural fluctuations of 
the system - how large are the correlations in the process of freezing out the configurational fluctuations? 
By estimating an effective number of degrees of freedom present in the system we find that the collective 
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effects in this process are indeed very high and that the asymmetry of the energy landscape is increased 
as the temperature is lowered. 
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